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t/3 . Abstract 

This paper makes two contributions towards determining some well-studied optimal con- 
I , stants in Fourier analysis and high-dimensional geometry. 

■ 1. It has been known since 1994 |GL94j that every linear threshold function has squared 
, Fourier mass at least 1/2 on its degree-0 and degree-1 coefhcients. Denote the minimum 

CN| ■ such Fourier mass by W-^[LTF], where the minimum is taken over all n-variable linear 

I threshold functions and all n > 0. O'Donnell [0'D12| has conjectured that the true value 

!>!■ of W^i[LTr] is 2/7r. 

. We make progress on this conjecture by proving that W--'^[LTF] > 1/2 + c for some 

I absolute constant c > 0. The key ingredient in our proof is a "robust" version of the 

. • . well-known Khintchine-Kahane inequality in functional analysis, which we believe may be 

of independent interest. 

■ 2. We give an algorithm with the following property; given any 77 > 0, the algorithm runs in 
^ . time 2P°'y(^/'') and determines the value of W--'^[LTF] up to an additive error of ±77. We 

give a similar 2'^^ '''''-time algorithm to determine Tomaszewski's constant to within an 
additive error of ±77; this is the minimum (over all origin-centered hyperplanes H) fraction 
of points in { — 1, 1}" that lie within Euclidean distance 1 of H. Tomaszewski's constant 
is conjectured to be 1/2; lower bounds on it have been given by Holzman and Kleitman 
|HK92j and independently by Ben-Tal, Nemirovski and Roos jBTNR02] . Our algorithms 
combine tools from anti-concentration of sums of independent random variables, Fourier 
analysis, and Hermite analysis of linear threshold functions. 
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1 Introduction 



This paper is inspired by a belief that simple mathematical objects should be well understood. We 
study two closely related kinds of simple objects: n-dimensional linear threshold functions f{x) = 
sign(u; ■ x — 9), and n-dimensional origin-centered hyperplanes H = {x £ M" : w- x = 0}. O'Donnell 
[0'D12j and Tomaszewski |Guy86| have posed the question of determining two universal constants 
related to halfspaces and origin-centered hyperplanes respectively; we refer to these quantities 
as "O'Donnell's constant" and "Tomaszewski 's constant." While these constants arise in various 
contexts including uniform-distribution learning and optimization theory, little progress has been 
made on determining their actual values over the past twenty years. In both cases there is an easy 
upper bound which is conjectured to be the correct value; Gotsman and Linial |GL94| gave the 
best previously known lower bound on O'Donnell's constant in 1994, and Holzmann and Kleitman 
|HK92j gave the best known lower bound on Tomaszewski's constant in 1992. 

We give two main results. The first of these is an improved lower bound on O'Donnell's constant; 
a key ingredient in the proof is a "robust" version of the well-known Khintchine-Kahane inequality, 
which we believe may be of independent interest. Our second main result is a pair of algorithms 
for computing O'Donnell's constant and Tomaszewski's constant up to any prescribed accuracy. 
The first algorithm, given any i] > 0, runs in time 2"^°^^^^^^^ and computes O'Donnell's constant up 
to an additive r], and the second algorithm runs in time 2^''°'^'^''''' and has the same performance 
guarantee for Tomaszewski's constant. 



1.1 Background and problem statements. 

First problem: low-degree Fourier weight of linear threshold functions. A linear threshold 
function, henceforth denoted simply LTF, is a function / : { — 1, 1}" — > 1} of the form f{x) = 
sign(it) ■ X — 9) where w G M"" and 6 € M (the univariate function sign : M — t- R is sign(z) = 1 for 
z > and sign(2:) = — 1 for z < 0). The values wi, . . . ,Wn are the weights and 6 is the threshold. 
Linear threshold functions play a central role in many areas of computer science such as concrete 
complexity theory and machine learning, see e.g. [DGJ"'"10] and the references therein. 

It is well known |BKS991 IPer04j that linear threshold functions are highly noise-stable, and 
hence they must have a large amount of Fourier weight at low degrees. For / : { — 1,1}" — )• M 
and k E [0,n] let us define W^[/] = Esc[nm=kf^iS) and W^''[f] = Ej=oWn/]; we will be 
particularly interested in the Fourier weight of LTFs at levels and 1. More precisely, for n £ N 
let LTF„ denote the set of all n-dimensional LTFs, and let LTF = U^^^LTF^. We define the 
following universal constant: 

Definition 1. 

O'Donnell (see the Conjecture following Theorem 2 of Section 5.1 of |0'D12j ) has conjectured 
that W^i[LTF] = 2/tt, and hence we will sometimes refer to W-^[LTF] as "O'Donnell's constant." 
As n — 7- oo, the n-variable Majority function shows that W-^[LTF] < 2/tt. Gotsman and Linial 
|GL94j observed that W-^[LTF] > 1/2 but until now no better lower bound was known. We note 
that since the universal constant W-^[LTF] is obtained by taking the infimum over an infinite set, 
it is not a priori clear whether the computational problem of computing or even approximating 
W^i[LTF] is decidable. 

Jackson [Jac06j has shown that improved lower bounds on W-^[LTF] translate directly into 
improved noise-tolerance bounds for agnostic weak learning of LTFs in the "Restricted Focus of 
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Attention" model of Ben-David and Dichterman |BDD98j . Further motivation for studying W-^ [/] 
comes from the fact that W^[/] is closely related to the noise stability of / (see |0'D12j ). In 
particular, if NSp[/] represents the noise stability of / when the noise rate is (1 — p)/2, then it is 
known that 

dNSp[/] 



dp 



= wi[/]. 

p=0 



This means that for a function / with E[/] = 0, we have NSp[/] — > p • W-^[/] as p — > 0. Thus, at 
very large noise rates, W^[/] quantifies the size of the "noisy boundary" of mean-zero functions /. 

Second problem: how many hypercube points have distance at most 1 from an origin- 
centered hyperplane? For n G N and n > 1, let S"""*^ denote the n-dimensional sphere S"^"*^ = 
{w G M" : \\'w\\2 = !}• Each unit vector w G W"^"^ defines an origin-centered hyperplane = 
{x G M" : w • X = 0}. Given a unit vector w G S"~^, we define T^{w) G [0,1] to be T{w) = 
Pr^g|_x i|7i • x\ < 1], the fraction of hypercube points in { — 1,1}" that lie within Euclidean 
distance 1 of the hyperplane Hw 

Let us now define S to be Uri>iS"~^. We define the following universal constant, which we call 
"Tomaszewski's constant:" 

Definition 2. 

T(S) =^ inf T{w) = inf T(S'"-i), where T(S"-i) =^ inf T(u;). 

Tomaszewski |Guy86| has conjectured that T(S) = 1/2. The main result of Holzman and 
Kleitman |HK92] is a proof that 3/8 < T(S); the upper bound T(S) < 1/2 is witnessed by the 
vector w = (l/\/2, l/\/2)- As noted by Holzman and Kleitman, the quantity T(§) has a number 
of appealing geometric and probabilistic reformulations. Similar to O'Donnell's constant, since the 
universal constant T(§) is obtained by taking the infimum over an infinite set, it is not immediately 
evident that any algorithm can compute or approximate T(S). 

Apart from being an interesting quantity in its own right, Tomaszewski's constant arises in 
a range of contexts in optimization theory, see e.g. |So09l IBTNR02] . In fact, the latter paper 
proves a lower bound of 1/3 on the value of Tomaszewski's constant independently of [HK92) . and 
independently conjectures that the optimal lower bound is 1/2. 



1.2 Our results. 

A better lov^er bound for O'Donnell's constant W-^[LTF]. Our first main result is the 
following theorem: 

Theorem 3 (Lower Bound for O'Donnell's constant). There exists a universal constant c' > 
such that W^i[LTF] > ^ + c'. 

This is the first improvement on the |GL94j lower bound of 1/2 since 1994. We actually give 
two quite different proofs of this theorem, which are sketched in the "Techniques" subsection below. 

An algorithm for approximating O'Donnell's constant W-^[LTF]. Our next main result 
shows that in fact there is a finite-time algorithm that approximates O'Donnell's constant up to 
any desired accuracy: 

^Whenever we speak of "an algorithm to compute or approximate" one of these constants, of course what we 
reaUy mean is an algorithm that outputs the desired value together with a proof of correctness of its output value. 
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Theorem 4 (Approximating O'Donnell's constant). There is an algorithm that, on input an ac- 
curacy parameter e > 0, runs in time 2^"^^^^^^^ and outputs a value such that 

[LTF] < r, < [LTF] + e. (1) 

An algorithm for approximating Tomaszewski's constant T(S). Our final main result is a 
similar-in-spirit algorithm that approximates Tomaszewski's constant up to any desired accuracy: 

Theorem 5 (Approximating Tomaszewski's constant). There is an algorithm that, on input an 
accuracy parameter e > 0, runs in time 2^''°'^ ^ and outputs a value such that 

T(§) < r, < T(S) + e. (2) 

1.3 Our techniques for Theorem [3l lower-bounding O'Donnell's constant W-^[LTF] 

It is easy to show that it suffices to consider the level- 1 Fourier weig ht of LTFs that have 
threshold 9 = 0; we confine our subsequent discussion to such zero-threshold LTFs. To explain our 
approaches to lower bounding W-^[LTF], we recall the essentials of Gotsman and Linial's simple 
argument that gives a lower bound of 1/2. The key ingredient of their argument is the well-known 
Khintchine-Kahane inequality from functional analysis: 

Definition 6. For a unit vector w € S"""*^ we define \^{w) E^g|_i x}n [\w ■ x\] to be the 
"Khintchine-Kahane constant for w. " 

The following is a classical theorem in functional analysis (we write to denote the unit vector in 
R" with a 1 in coordinate z): 

Theorem 7 (Khintchine-Kahane inequality, |Sza76j ). For w G S" any unit vector, we have K(t(;) > 
I/V2, with equality holding if and only ^/ ^ = (icj it Cj) for some i ^ j € [n]. 

Szarek |Sza76j was the first to obtain the optimal constant l/\/2, and subsequently several 
simplifications of his proof were given |Haa821 ITom871 IL094j : we shall give a simple self-contained 
proof in Section [3. II below, which is quite similar to Filmus's |Fill2j translation of the |L094j proof 
into "Fourier language." With Theorem [7] in hand, the Gotsman-Linial lower bound is almost 
immediate: 

Proposition 8 ([GL94]). Let f : {-1, 1}" {-1, 1} he a zero-threshold LTF f{x) = sign(w • x) 
where w G M" has \\w\\2 = 1. Then Wi[/] > (K(u;))^ . 

Proof. We have that 

n ^ n ^ In 

K(«;) = E,[/(x)(i« • x)] = E f{^m <JE • J E ^? = v/WMTJ 

i=l y i=l y i=l 

where the first equality uses the definition of /, the second is Plancherel's identity, the inequality 
is Cauchy-Schwarz, and the last equality uses the assumption that is a unit vector. □ 

First proof of Theorem [3l A "robust" Khintchine-Kahane inequality. Given the strict 
condition required for equality in the Khintchine-Kahane inequality, it is natural to expect that if 
a unit vector w G is "far" from (icj ± Cj), then K.{w) should be significantly larger than 

I/V2. We prove a robust version of the Khintchine-Kahane inequality which makes this intuition 
precise. Given a unit vector w G define d{w) to be d{w) = min \\w — w*\\2, where w* ranges 

over all 4(2) vectors of the form -^(ibei it Cj). Our "robust Khintchine-Kahane" inequality is the 
following: 
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Theorem 9 (Robust Khintchine-Kahane inequality). There exists a universal constant c > such 
that for any w G S"~^, we have 

K('u;) > ^ +c-d{w). 
v2 

Armed with our robust Khintchine-Kahane inequality, the simple proof of Proposition[8]suggests 
a natural approach to lower-bounding W-^[LTF]. If w is such that d{w) is "large" (at least some 
absolute constant), then the statement of Proposition [8] immediately gives a lower bound better 
than 1/2. So the only remaining vectors w to handle are highly constrained vectors which are almost 
exactly of the form -^{±ei ± e^); one might naturally hope that the Cauchy-Schwarz inequality 
in the proof of Proposition [8] is not tight for such highly constrained vectors, and indeed this is 
essentially how we proceed (modulo some simple cases in which it is easy to bound W-^ above 1/2 
directly) . 

Second proof of Theorem [3} anticoncentration, Fourier analysis of LTFs, and LTF 
approximation. Our second proof of Theorem [3] employs several sophisticated ingredients from 
recent work on structural properties of LTFs [OSIH IMORSIO] . The first of these ingredients is 
a result (Theorem 6.1 of |QSllj ) which essentially says that any LTF /(x) = sign(w • x) can be 
perturbed very slightly to another LTF f'{x) = sign('u;' ■ x) (where both w and w' are unit vectors). 
The key properties of this perturbation are that (i) / and /' are extremely close, differing only 
on a tiny fraction of inputs in {—1, 1}"; but (ii) the linear form w' ■ x has some nontrivial "anti- 
concentration" when X is distributed uniformly over {—1, 1}*^, meaning that very few inputs have 
w' ■ X very close to 0. 

Why is this useful? It turns out that the anti-concentration of w' ■ x, together with results 
on the degree- 1 Fourier spectrum of "regular" halfspaces from [MORS 10] . lets us establish a lower 
bound on W-^[/'] that is strictly greater than 1/2. Then the fact that / and /' agree on almost 
every input in {—1, 1}" lets us argue that the original LTF / must similarly have W-^[/] strictly 
greater than 1/2. Interestingly, the lower bound on W-^[/'] is proved using the Gotsman-Linial 
inequality W-^[/'] > (K(it;))^; in fact, the anti-concentration of w' -x is combined with ingredients 
in the simple Fourier proof of the (original, non-robust) Khintchine-Kahane inequality (specifically, 
an upper bound on the total infiuence of the function i(x) = \w' ■ x\) to obtain the result. 

1.4 Our techniques for Theorem[4l approximating O'Donnell's constant W-^[LTF] 

As in the previous subsection, it suffices to consider only zero-threshold LTFs sign(tt; • x). Our 
algorithm turns out to be very simple (though its analysis is not): 

Let K = 0(e~^'^). Enumerate all variable zero-threshold LTFs, and output the value 
Fg =^min{W^[/] : / is a zero-threshold variable LTF.}. 

It is well known (see e.g. |MT94j ) that there exist 2®^-^^) distinct variable LTFs, and it is 
straightforward to confirm that they can be enumerated in output-polynomial time. Thus the 
above simple algorithm runs in time 2?°^^^^/^^; the challenge is to show that the value F^ thus 
obtained indeed satisfies Equation ([T]). 

A key ingredient in our analysis is the notion of the "critical index" of an LTF /. The critical 
index was implicitly introduced and used in |Ser07j and was explicitly used in [DS091 iDGJ+lOl 
lOSm IDDFS12] and other works. To define the critical index we need to first define "regularity" : 
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Definition 10 (regularity). Fix r > 0. We say that a vector w = {wi, . . . ,Wn) G IK" is r-regular 
if maxjgj„] \wi\ < t\\w\\ = t\J + • • • + w\. A linear form w ■ x is said to be r-regular if w is 
T-regular, and similarly an LTF is said to be r-regular if it is of the form sign(tf; ■ x — 9) where w 
is T-regular. 

Regularity is a helpful notion because if w is r-regular then the Berry-Esseen theorem tells us 
that for uniform x € {—1, 1}", the linear form w ■ x is "distributed like a Gaussian up to error r." 
This can be useful for many reasons (as we will see below). 

Intuitively, the critical index of w is the first index i such that from that point on, the vector 
{wi,Wi-^.l, . . . , Wn) is regular. A precise definition follows: 

Definition 11 (critical index). Given a vector w G M" such that \wi\ > ••• > \wn\ > 0, for 
k G [n] we denote by the quantity \jYl^=k '^f- define the r-critical index c{w, r) of w as the 
smallest index i £ [n] for which \wi\ < t ■ ai. If this inequality does not hold for any i £ [n], we 
define c{w,t) = oo. 

Returning to Theorem U since our algorithm minimizes over a proper subset of all LTFs, it 
suffices to show that for any zero-threshold LTF / = sign{w-x), there is a X- variable zero-threshold 
LTF g such that 

W^bl-Wi [/]<£. (3) 

At a high level our proof is a case analysis based on the size of the (5-critical index c{w, 5) 
of the weight vector w, where we choose the parameter 5 to he 5 = poly(e). The first case is 
relatively easy: if the 5-critical index is large, then it is known that the function / is very close to 
some i^-variable LTF g. Since the two functions agree almost everywhere, it is easy to show that 
\W^[f] -W^[g]\<e as desired. 

The case that the critical index is small is much more challenging. In this case it is by no means 
true that / can be well approximated by an LTF on few variables - consider, for example, the 
majority function. We deal with this challenge by developing a novel variable reduction technique 
which lets us construct a poly(l/e)-variable LTF g whose level-1 Fourier weight closely matches 
that of /. 

How is this accomplished? The answer again comes from the critical index. Since the critical 
index c{w, 5) is small, we know that except for the "head" portion Yli^i^^ ^ WiXi of the linear 
form, the "tail" portion J27=c{w 5) '^i^i of the linear form "behaves like a Gaussian." Guided by 
this intuition, our variable reduction technique proceeds in three steps. In the first step, we replace 
the tail coordinates xt = {xc(w,S)-, ■ ■ ■ j^n) by independent Gaussian random variables and show 
that the degree- 1 Fourier weight of the corresponding "mixed" function (which has some itl-valued 
inputs and some Gaussian inputs) is approximately equal to W^[/]. In the second step, we replace 
the tail random variable wt ■ Gt, where Gt is the vector of Gaussians from the first step, by a single 
Gaussian random variable G, where G ~ AA(0, Hi^TlP)- We show that this transformation exactly 
preserves the degree- 1 weight. At this point we have reduced the number of variables from n down 
to c{w,5) (which is small in this case!), but the last variable is Gaussian rather than Boolean. As 
suggested by the Central Limit Theorem, though, one may try to replace this Gaussian random 
variable by a normalized sum of independent ±1 random variables "^ff^i Zi / y/M . This is exactly 
the third step of our variable reduction technique. Via a careful analysis, we show that by taking 
M = poly(l/e), this operation preserves the degree-1 weight up to an additive e. Combining all 
these steps, we obtain the desired result. 
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1.5 Our techniques for Theorem[5t approximating Tomaszewski's constant T(S) 

The first step of our proof of Theorem is similar in spirit to the main structural ingredient of our 
proof of Theorem [3J we show (Theorem [69]) that given any e > 0, there is a value K^^ = poly(l/e) 
such that it suffices to consider linear forms w ■ x over i^e-dimensional space, i.e. for any n S N we 
have 

T(§"-i) < T(S^'-i) < T(§"-i) + e. 

Similar to the high-level outline of Theorem [H our proof of again proceeds by fixing any w G S"~^ 
and doing a case analysis based on whether the critical index of w is "large" or "small." However, 
the technical details of each of these cases is quite different from the earlier proof. In the "small 
critical index" case we employ Gaussian anti-concentration (which is inherited by the "tail" random 
variable wtxt since the tail vector wt is regular), and in the "large critical index" case we use an 
anti-concentration result from |0S11| . 

Unlike the previous situation for O'Donnell's constant, at this point more work remains to be 
done for approximating Tomaszewski's constant. While there are only 2P°^y(^/^) many halfspaces 
over poly(l/e) many variables and hence a brute-force enumeration could cover all of them in 
2P°iy{i/'^) time for O'Donnell's constant, here we must contend with the fact that is an 

uncountably infinite set, so we cannot naively minimize over all its elements. Instead we take a 
dual approach and exploit the fact that while there are uncountably infinitely many vectors in 
§^^-^ there are only 2^" many hypercube points in { — 1, 1}^% and (with some care) the desired 
infimum over all unit vectors can be formulated in the language of existential theory of the reals. 
We then use an algorithm for deciding existential theory of the reals (see |Ren88j ) to compute the 
infimum. 

Discussion. It is interesting to note that determining Tomaszewski's constant is an instance of the 
well-studied generic problem of understanding tails of Rademacher sums. For the sake of discussion, 
let us define Tin(w,a) = Pr^^^_i^iyi[\w ■ x\ < a] and Tout('W^ia) = Pr^g|_i ijn • 3;| > a] where 
w G Further, let Tin(a) = inf^^igs Tin(w, a) and Tout(a) = inf«;g§ Tout('U^, a). Note that 

Tomaszewski's constant T(§) is simply Tjn(l). Much research effort has been expended on getting 
sharp estimates for Tin (a) and Tout(o) for various values of a (see e.g. |Pinl2[ IBen04] ) . As a 
representative example, Bentkus and Dzindzalieta jBD12j proved that 

for a € (1, v^]. Similarly, Pinelis |Pin94| proved that there is an absolute constant c > such that 
Tout(a) > 1 — c • where (/>(x) is the density function of the standard normal M{0, 1) (note that 
this beats the standard Hoeffding bound by a factor of I /a). 

On the complementary side, Montgomery-Smith |MS90| proved that there is an absolute con- 
stant c' > such that Tout(o) > . Similarly, Oleszkiewicz |01e96j proved that Tout(l) > 1/10. 
The conjectured lower bound on Tout(l) is 7/64 (see [HK94] ) . While we have not investigated 
this possibility in detail, we suspect that our techniques may be applicable to some of the above 
problems. Finally, we note that apart from being of intrinsic interest to functional analysts 
and probability theorists, the above quantities arise frequently in the optimization literature (see 
[HLNZOSj IBTNR02] ) . Related tail bounds have also found applications in extremal combintorics 
(see |AHS12j ^. 
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2 Mathematical Preliminaries 



2.1 Fourier analysis over { — 1, 1}" and influences 

We consider functions / : { — 1, 1}" — )• M (though we often focus on Boolean-valued functions 
which map to {—1, 1}), and we think of the inputs x to / as being distributed according to the 
uniform probability distribution. The set of such functions forms a 2"-dimensional inner product 
space with inner product given by {f,g) = Ex[f{x)g{x)]. The set of functions {xs)sc[n] defined 
by xs{x) = riieS-^* forms a complete orthonormal basis for this space. We will also often write 
simply xs for HieS^*- Given a function / : {—1,1}" — )• R we define its Fourier coefficients by 
f{S) = 'Ex[f{x)xs], and we have that f{x) = Ylsfi'^)^s- 

As an easy consequence of orthonormahty we have Plancherel's identity {f,g) = /('S')5(5'), 
which has as a special case Parseval's identity, ¥jx[f{x)'^] = '^s f{S)'^. From this it follows that 
for every / : {-1, 1}" {-1, 1} we have f{Sf = 1. Note that for / : {-1, 1}" ^ M we have 
that Var[/] = E,[/2(x)] - (E,[/])2 = ^5^0 f^iS)- 

Definition 12. Given f : {—1,1}" — ?• R and i G [n], the influence of variable i is defined as 
Infj(/) = Ea; [Vara;J/(x)]]. The total influence of f is defined as Inf(/) = X]"=i Infi(/). 

Fact 13. We have the identity Infj(/) = Yls^i f^i'^)' moreover, for f : { — 1, 1}" — ^ { — 1, 1} (i.e. 
Boolean-valued), it holds Infj(/) = Pr^[/(x*~) 7^ /(x*"*")], where and x*"^ denote x with the 
i'th bit set to —1 or 1 respectively. If f : {—1, 1}" — )• {—1, 1} is unate, then Infj(/) = 

Fact 14. Let f = sign(^^^-^ WjXj — wq) be an LTF such that \wi\ > \wi\ for all i G [n]. Then 
|Infi(/)| > |Infj(/)| for all i € [n]. Moreover, for all i G [n] it holds Wi ■ f{i) > 0. 

2.2 Probabilistic Facts 

We require some basic probability results including the standard additive Hoeffding bound: 

Theorem 15. Let Xi, . . . ,X„ be independent random variables such that for each j G [n], Xj is 
supported on [a-j,bj] for some aj,bj G R, aj < bj. Let X = Yl'j=i'^r Then, for any t > 0, 

Pr [\X - E[X]| >t\< 2exp (-2tVE"=i(bi " • 

The Berry-Esseen theorem (see e.g. |Fel68j ) gives explicit error bounds for the Central Limit 
Theorem: 

Theorem 16. (Berry-Esseen) Let Xi, . . . ,Xn be independent random variables satisfying E[Xj] = 
for all i G [n], ^J^^~B[xf] = a, and Y.i E[|^iP] = Ps- Let S = {Xi + ■ ■ ■ + X„,)/cj and let F 
denote the cumulative distribution function (cdf) of S. Then sup^, \F{x) — ^{x)\ < ps/cr^ where <I> 
denotes the cdf of the standard gaussian random variable. 

An easy consequence of the Berry-Esseen theorem is the following fact, which says that a regular 
linear form has good anti-concentration (i.e. it assigns small probability mass to any small interval): 

Fact 17. Let w = {wi, . . . ,Wn) be a r-regular vector in R" and write a to denote \\w\\2. Then for 
any interval [a,b] C R, we have | PrE"^;^ ■WiXj G (a, 6]] — ^{[a/a,b/a])\ < 2t, where ^{[c,d]) = 
$(c?) — $(c). In particular, it follows that Pr [ Y17=i '^i^i ^ ("^j ^]] ^ 1^ ~ ^^l/^ + 2t. 
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2.3 Technical Tools about Regularity and the Critical Index 

The following simple fact states that the "tail weight" of the vector w decreases exponentially prior 
to the critical index: 

Fact 18. For any vector w = {wi, . . . ,Wn) such that \wi\ > • • • > \wn\ > and 1 < a < c{w,t), 
we have da < (1 - t^)^''"^)/^ • ai. 



Proof. If a < c{w,t), then by definition \wa\ > t ■ aa- This implies that fJa+i < \/l — • aa- 
Applying this inequality repeatedly, we get that ctq < (1 — r^)'^'^"^-'/^ -cJi for any 1 < a < c{w, r). □ 

We will also need the following corollary (that appears e.g. as Propositions 31 and 32 in [MORS 10] ). 

Fact 19. Let £{x) = w-x — wq with ||i(7||2 = 1 and wq £U. and f{x) = sign(£(x)). If w is T-regular, 
then we have: 

• (i) E-,^uJf{x)] E^r^xn[f{x)] and 

. (ii)E,^uA\i{x)\]^-E,^M„mx)\], 
where M denotes the standard (i.e. zero mean, unit variance) Gaussian distribution M (0,1). 

2.4 Miscellaneous 

For a, 5 G M we write a & b to indicate that |a — &| < 0{ifj. 

For a vector w G M", we write w^'^^ to denote the (n — A:)-dimensional vector obtained by taking 
the last n — k coordinates of w, i.e. w^'^^ = (wfc+i, • • • , Wn). 

We will use the following elementary fact, which is a direct consequence of Cauchy-Schwarz. 

Fact 20. Let a,6 G with \\a\\2 < 1, \\b\\2 < 1 such that \\a - < r]. Then 



Ml 



Proof. We have that 



i=l 



E (oi - bi) {ai + bi) 



i=l 



2 2 

< \ J2^{ai + bi) •WE(ai-&j) 



< yS-E (a2 + 62).||a-5||2<2V^ 

where the first inequality is Cauchy-Schwarz, the second uses the elementary fact (a + 6)^ < 
2(a^ + 5^), for all a, 6 G M, while the third uses our assumption that ||a||2, ||^||2 ^1- D 



3 Proof of Theorem [9t A "robust" Khintchine-Kahane inequality 

It will be convenient for us to reformulate Theorems [7] and O as follows: Let us say that a unit 
vector w = {wi, . . . ,Wn) S S""^ is proper if vui > Wi+i > for all i G [n — 1]. Then we may 
state the "basic" Khintchine-Kahane inequality with optimal constant. Theorem [71 in the following 
equivalent way: 



Theorem 21 (Khintchine-Kahane inequality, |Sza76j ). Let w G M"' be a proper unit vector. Then 
K(t(;) > I/V2, with equality holding if and only if w = w* {l/y/2, 1/V2, 0, . . . , 0). 
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And we may restate our "robust" Khintchine-Kahane inequality, Theorem [9l as follows: 

Theorem 22 (Robust Khintchine-Kahane inequality). There exists a universal constant c > such 
that the following holds: Let w G M" be a proper unit vector. Then K.{w) > l/\^+c-\\w—w* \\2, where 

w* =^(1/^/2, 1/^2, 0,...,0). 

Before we proceed with the proof of Theorem [22l we give a simple Fourier analytic proof of 
the "basic" Khintchine-Kahane inequality with optimal constant, K.{ut) > l/\/2- We then build on 
this to prove Theorem 1221 

3.1 Warm-up: simple proof that K(w) > 1/a/2 

We consider the function i{x) = where "^^wf = 1 and will show that K.{w) = 

Ex[^(x)] > l/\/2. Noting that E^[(£(x))2] = 1, we have (E[£(j;)])2 = 1 - Var[£], so it suffices 
to show that Var[^] < 1/2. This follows directly by combining the following claims. The first 
bound is an improved Poincare inequality for even functions: 

Fact 23. (Poincare inequality) Let f : {-1, 1}" ^ R be even. Then Var[/] < (1/2) • Inf(/). 
Proof. Since / is even, we have that f{S) = for all 5 with odd l^j. We can thus write 

Inf(/)= E \S\-f^{S)>2. E ?{S) = 2. Y: ?(5) = 2.Var[/]. 

5C[n],|S| even 9j^SC[n],\S\ even 07^SC[n] □ 

The second is an upper bound on the infiuences in £ as a function of the weights: 
Lemma 24. Let i[x) = \ Yl7=i -^'^'^ ^^^2/ ^ ^ ['^]; have Infj(£) < wf. 

Proof Recah that Infi(^) = [Var^^ [^x]]] . We claim that for any x £ {-1, 1}", it holds that 
Var3;J^(x)] < w'^, which yields the lemma. To show this claim we write £{x) = \wiXi + Cj|, where 
'^i — '^jjf^i'^j ' Xj does not depend on Xj. We want to show an upper bound on the quantity 

Since i'^{x) = cj + wf + 2ciWiXi, it follows that Ea;J£^(x)] = cf + vuf. For the expected value by 
definition we can write E^^. [^(x)] = (1/2) • {\wi — Ci\ + \wi + Cj|). To proceed we consider two cases 
based on the relative magnitudes of Cj and Wi. 

If \ci\ < \wi\, we have E^A£{x)] = (1/2) • {sign{wi){vji - q) + sign{wi){wi + Cj)) = \wi\. Hence, 
in this case Var^;. [^(x)] = cf < wf (where the last inequality follows by assumption). If on the other 
hand \ci\ > \wi\, then we have E3;.[^(x)] = (1/2) • (sign(cj)(cj — vui) -|- sign(cj)(ci + Wi)) = \ci\ and 
therefore Var^,. [^(a;)] = wf. Thus, in both cases we have Var^,. [^(x)] < wf as was to be shown. □ 

The bound K.{w) > l/\/2 follows from the above two claims using the fact that £ is even and 
thatEi^- =1- 

3.2 Proof of Theorem [22] 

Let w G M" be a proper unit vector and denote t = \\w — w*\\2. To prove Theorem 1221 one would 
intuitively want to obtain a robust version of the simple Fourier-analytic proof of Theorem [2T] from 
the previous subsection. Recall that the latter proof boils down to the following: 

Var[^] < (1/2) • Inf (^) = (1/2) • £ Inf.(^) < (1/2) ■ t = 1/2 

i=l 1=1 



9 



where the first inequahty is Fact [23] and the second is Lemma [2^ While it is clear that both 
inequalities can be individually tight, one could hope to show that both inequalities cannot be 
tight simultaneously. It turns out that this intuition is not quite true, however it holds if one 
imposes some additional conditions on the weight vector w. The remaining cases for w that do not 
satisfy these conditions can be handled by elementary arguments. 

We first note that without loss of generality we may assume that wi = maxj Wi > 0.3, for 
otherwise Theorem [22] follows directly from the following result of Konig et ah 

Theorem 25 ( |KSTJ99] ). For a proper unit vectorw G M", we have K(tt;) > ^2l'K—(\ — ^2j'K')w\. 

Indeed, if wx < 0.3, the above theorem gives that 

K{w) > 1.3a/2/V - 0.3 > 0.737 > 1/V2 + 3/100 > 1/V2 + (l/50)r, 

where the last inequality follows from the fact that r < \/2 (as both w and w* are unit vectors). 
Hence, we will henceforth assume that wi > 0.3. 

The preceding discussion leads us to the following definition: 

Definition 26 (canonical vector). We say that a proper unit vector w £ is canonical if it 
satisfies the following conditions: 

(a) wi G [0.3,1/^2 + 1/100]; 

(h) T = \\w- w*\\2 > 2/5; 

The following lemma establishes Theorem 1221 for non-canonical vectors: 

Lemma 27. Let w he a proper non-canonical vector. Then K(w) > \/\/2 + (1/1000)t, where 

T = \\w — W* II2. 

The proof of Lemma [27l is elementary, using only basic facts about symmetric random variables, 
but sufficiently long that we give it in Section 13.31 For canonical vectors we show: 

Theorem 28. There exist universal constants ci,C2 > such that: Let w £ M" be canonical. 
Consider the mapping £(x) = \w ■ x\. Then at least one of the following statements is true : 

(1) Infi(£) <u;f-ci; 

(2) W>2[^] > C2. 

This proof is more involved, using Fourier analysis and critical index arguments. We defer 
it to Section 13.41 and proceed now to show that for canonical vectors. Theorem [22] follows from 
Theorem 1281 To see this we argue as follows: Let w G M"" be canonical. We will show that there 
exists a universal constant c > such that K('u;) > \/^/2 + c; as mentioned above, since r < \/2, 
this is sufficient for our purposes. Now recall that 

K(t/;) = [£{x)] = m = Vl- Var[^]. (4) 

In both cases, we will show that there exists a constant c' > such that 

Var[^] < 1/2 - c. (5) 

From this ^ gives 

K(u;) > ^/l/2 + c' = 1/^2 + c 
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where c" > is a universal constant, so to establish Theorem 1221 it suffices to establish ([5]). 

Suppose first that statement (1) of Theorem [28] holds. In this case we exploit the fact that 
Lemma [53] is not tight. We can write 

Var[£] < (1/2) • Inf(/) < (1/2) • (^wj - ci + < (1/2) - ci/2, 

giving Now suppose that statement (2) of Theorem 1281 holds, i.e. at least a C2 fraction of 
the total Fourier mass of i lies above level 2. Since i is even, this is equivalent to the statement 
W-^[^] > C2. In this case, we prove a better upper bound on the variance because Fact [23] is not 
tight. In particular, we have 

Inf(^) > 2W^[e] + 4W^^[e] = 2 (Var[^] - W^^[e]) + 4W^^[l] = 2 Var[^] + 2W^^[e] 

which yields 

Yar[i] < (l/2)Inf (£) - W^^[£] < (1/2) - C2, 
again giving ([5|) as desired. 



3.3 Proof of Lemma] 

We will need the following important claim for the proof of Lemma [27] 

Claim 29. Let X be a symmetric discrete random variable supported on K, i.e. Pr[X = x] = 
Pr[X = —x] for all x G M. Then for all c €M. we have 

max{E[|X|],|c|} < B[\X + c\]. 

Proof. The proof of the claim relies on the following elementary fact whose correctness can be 
checked by a straightforward case analysis. 

Fact 30. For all x, y G M, we have (|x + y| + |x — y\)/2 = max{|a;|, \y\}. 
Let p{x) = Pr[X = x]. We start by proving that E[|X + c|] > |c|. Indeed, 

(|x + c| + I - X + c|) 



B[\X + c\] = Y:Pix)-\X + c\=p{0)-\c\+^ip{x)+p{-x)) 

X x>0 ^ 

> p(0) • |c| + {p{x) + p{—x)) ■ \c\ = \c\ 

x>0 

where the inequality uses Fact [30l Similarly, we show that E[|X + c|] > E[|X|]. Indeed, 

{\x + c\ + \ — X + c\ 



E[|X + c|] = EPix)-\X + c\=p{0)-\c\+ ^ {p{x)+p{-x)) 

X x>0 ^ 

> P(0) • |c| + E (Pi^) +Pi-x)) • kl > E (Pi^) +P{-x)) • \x\ = E[|X|]. 

x>0 x>0 

This completes the proof of the claim. □ 
Proof of Lemma 1271 If -uj is a non-canonical vector, then there are exactly two possibilities : 

Case 1: wi [0.3,1/^2 + 1/100]. In case wi < 0.3, then the calculation following Theorem [25] 
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already gives us that K{w) > 1/^2 + (l/50)r. The other possibihty is that wi > 1/V2 + 1/100. 
In this case, 



K{w) = E 



i=l 



(1/2).E 



1=2 



+ (l/2)-E 



i=2 



^ \wi\ + \wi\ , , 
— n = l^ll 



where the inequahty is an apphcation of Claim [29l As \wi\ > 1/^/2 + 1/100, we get that 

K{w) > l/\/2 + 1/100 > l/\/2 + 1/200 • r 

(using that r < ^/2) . 

Case 2: r < 2/5. Of course, here we can also assume that wi G [0.3, l/\/2 + 1/100] (since 
otherwise. Case 1 proves the claim). We let wi = l/\/2 — a and W2 = 1/V2 — b and ""^ ~ 
By definition, we have that a <b and 6 > 0. Also, 



I * 1 1 2 2 I l2 I 2 

\w — w = a +0 + c . 



Moreover, since w is a unit vector, we have that 



a'^ + b'^ + c'^ = V2{a + b). 
Expanding the expression for K.{w) on xi,X2, we get (recalling that x^^^ = (2:3, . . . ,Xn)) 



(6) 



(7) 



x-(2)e{-i,i}"-2 



\/2 - (a + 6) + • x^^) 



(2)e{-i,i}"-2 



> ^ • fmax{|^/2- (a + 6)|,E[| 

^(2) . x^^^l]} + max{|a — 6|, E[|w^^^ • 



> 



- • ^max 1 1 V2 - 



(a + 6)1, 



V2 



+ max < |a — 6|, 



V2 



where the first inequality follows from Claim [29] and the second inequality uses the fact E[|^«(^) 
x^'^^l] > c/y/2 (as follows from Theorem [7|). We consider two further sub-cases : 

(^ase 2(a): Let > Then, we can bound the right hand-side from below as follows: 

(^max||^/2- (a + 6)1,-^1 + max ||a-5|,-^|) > i . |^| ^2 - (a + 6)| + 



1 

> - 
- 2 



V2 



+ 



1 



r r 

+ 



where the second inequality uses ((Tj). As long as r < 2/5, it is easy to check that 

^2 



T T 

> 



40 2V2 - 1000 
which proves the assertion in this case. 

Case 2(b): Let < r^/20. In this case, we will prove a lower bound on |a — 5|. Using < r^/20 
and dH), we have + 6^ > (19t^)/20. Also, using d?]), we have a + b = t'^/\/2. We now have 



(a - by = 2{a' + b')-{a + by >2- — -T 



19 2 l! > 1^ 2 

2 - 10^ 



T 9 
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The last inequality uses r < 2/5. Now, as in Case 2(a), we have 

(^max||^/2-(a + 6)|,-^|+max||a-6|,-^|) > 1 . (^| ^2 - (a + 6)| + |a - 6|) 



1 

> - 
- 2 



V2 



V2 



r 1 r 1 r 

+ 77 = ^ ^ + - > - + 



2 ^ 2^/2 2 - 2 1000 



Again, the last inequality uses that r < 2/5. This finishes the proof of Lemma [271 □ 
3.4 Proof of Theorem [28] 

We will prove that if w G M" is a canonical vector such that Infi(£) > u^f — ci, then W-^^[£] > C2. 
For the sake of intuition, we start by providing a proof sketch for the special case that ci = 0. At 
a high-level, the actual proof will be a robust version of this sketch using the notion of the critical 
index to make the simple arguments for the "ci = case" robust. For this case, it suffices to prove 
the following implication: 

If Infi(£) = w\, then at least a constant fraction of the Fourier weight of I lies above 
level 2. 

Indeed, we have the following claims: 

(1) Let w be canonical and Infi(£) = wl. Then w equals {wi^ . . . , loi, 0, . . . , 0) where there are k 
repetitions of wi and k is even. We call such a w "good". 

(2) Let tt; be a good vector. Then I has 0(1) Fourier weight above level 2. 

We can prove (1) as follows. Suppose that Infi(£) = w\. Then, as implied by the proof of 
Lemma [Ml every outcome p^^-* of {x2, ■ ■ ■ ,Xn) has \w^'^'> ■ p^'^'^\ > wi. Suppose, for the sake of 
contradiction, that some coordinate wj is neither equal to wi nor to 0. Let {k > 2) be the first 
such value. By having p2, ■ ■ ■ , Pk~i alternate between +1 and —1 we can ensure that there is an 
assignment of p2, ■ ■ ■ , Pk-i such that W2P2 + • • • + w^-iPk-i is either (if k is even) or wi (if k is 
odd). In the former case, by choosing the remaining p bits appropriately we get that there exists 
an assignment p such that \w^'^^ ■ p^'^^\ <Wk <wi^ where the inequality uses the fact that the WiS 
are non- increasing and our assumption that ^ wi. In the latter case, if is the last nonzero 
entry, for an appropriate p, we can get {w^"^^ ■ p^'^^ = wi — Wk < wi. Otherwise, if there are other 
nonzero entries beyond Wk we can similarly get Iw^"^^ ■ p^^^l < w^- So we have argued that if there is 
any Wk ^ {0, wi} then it cannot be the case that Infi(^) = wf, so w must be of the form (k copies 
of Wl followed by O's). If k is odd, then clearly there exists a p such that l^^^^ • p^^^l = 0. So, it 
must be the case that k is even. This proves (1). Given (1) in hand, we may conclude (2) using 
the following lemma (Lemma 13 ip and the observation that since wi > 0.3 we must have k < 12: 



Lemma 31. Let ik{x) 



(xi+...+a'fc) 
Vk 



Fork>4 and even, W^'^[i] > 



Proof. We start by observing that because £k{x) only takes values which are integral multiples of 
it must be the case that for any character xs, the value ik{S) = ^[xs{x) -ikix)] is a multiple 
of 2-^= • A;-^/2_ Hence, any non-zero Fourier coefficient of ik is at least 2 ■ k ^'^ in magnitude. 
Thus, if W-^[£] 7^ 0, then W-^[£] > k~^2~'^^ . Thus, to prove the lemma, we need to show that 

Next, we observe that ^fc(x) is an even function and hence any Fourier coefficient f{S) = if IS"! 
is odd. Thus, towards a contradiction, if we assume that W-^[^] = 0, then the Fourier expansion of 
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^fc(x) must consist solely of a constant term and degree 2 terms. As the function Ikix) is symmetric, 
we may let the coefficient of any quadratic term be a and the constant term be /3, and we have 



ik{x) = (3 + y a ■ XiXj = (3 + a ■ I y XiXj | = /3 




ak 



o-(^Ea;iJ +/3- — =7i(^gx,J +72 

where 71 = a/2 and 72 = /3 — Note that since k is even, there exist assignments x G {—1, l}'^ 
that cause X^jLi a^j to take any even value in [—k, k]; in particular, since k > A, the sum "^^^i Xi 
may take any of the values 0,2,4. 

Now, if X]i=i^« — 0) then £k{x) = 0. Hence we infer that 72 = 0. If X^jLi a^j = 2 then 
4(2;) = 2/\/fc, and if 2;j = 4 then 4(x) = 4/y/k. Clearly, there is no 71 satisfying both 

71 • 2^ = 2/y/k and 71 • 4^ = A/y/k. This gives a contradiction. Hence W-^[-^] 7^ and the lemma 
is proved. □ 



We can now proceed with the formal proof of Theorem [28j We will need several facts and inter- 
mediate lemmas. The first few facts show some easy concentration properties for weighted linear 
combinations of random signs under certain conditions on the weights. 

Claim 32. Fix a > 0. Let wi, . . . , Wn S K satisfy \wi\ < a for all i. Then there exists x* £ {—1, 1}" 
such that w ■ X* £ [0, a] (and clearly —x* G { — 1, 1}" has w ■ {—x*) G [— a,0]J. 

Proof. Construct x' £ { — 1,1}" one bit at a time, by choosing x'^_^_^ so that sign(u7j_|_ix^_j_]^) — 
— sign(it;ix']^ + • • • + Wix'^. The resulting vector x' satisfies {w • x'\ < a. □ 

As a special case of this we get: 

Claim 33. Fix < r] < a. Let Wj G j G [2k + 1], satisfy Wj £ [a — rj,a]. Then, there exists 
X* = (x*, . . . G {-1, 1}2^+1 such that: X^^fc+i ^^.3.* ^ [q^q]. 

The following claim is only slightly less immediate: 

Claim 34. Fix < rj < a. Let wj £ , j £ [2k], satisfy wj £ [a — ri,a]. Then, there exists 
X* = {xl, . . . ,4^) G {-1, l}^'^ such that: Y.f=i WjX* £ [0,r?]. 

Proof. The vector n G M'^ defined by uj = W2j — W2j^i has [uj[ < rj for all j £ [k]. It is clear that 
the set of values {w ■ a^}xe{-i,i}2fe is contained in {u ■ x}^^^_i iyk. The claim follows by applying 
Claim [32] to -u. ' ' □ 

We will also need the following corollary of the Berry-Esseen theorem (more precisely, it follows 
from Fact [T7] together with the fact that the pdf of a standard Gaussian has value at least 0.2 
everywhere on [—1, 1]): 

Fact 35. FixO < r < 1/15. Letw £ M" be t -regular with [[w[[2 < 1. Then, Prx[0 < w-x < 15r] > r 
and Pra;[— 15t < w ■ x < 0] > t. 

We are now ready to prove the following lemma which establishes a concentration statement 
for linear forms with a given maximum coefficient: 
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Lemma 36. Let w S M" be proper with \\w\\2 < 1 and let 6 = wi > 0. There exists k = k{6) such 
that Pr^[0 < w ■ X < 6] > K. 

Proof. We choose a sufficiently small r > 0, where r = t{6) <^ 6, and consider the r-critical index 
K = c{w,t) of w. Fix Kq = 0(1/t^) • log(l/(5^) and consider the following two cases: 

[Case \: K < Kq.\ In this case, we partition [n] into the head H = [K — 1] and the tail T = [n]\H. 
Then, an application of Claims [M] and [33] for r] = a = 5 gives us that 

Pr,^ [WH ■ XH G [0, 6]] > 2-'^ > 2-^». 

An application of Fact [35] for the r-regular tail gives that Pr^.^, [wt • xt G [— 15r, 0]] > r and 
combining the above inequalities using independence yields 

Pr^. [wx£ [-15r, 5]] > 2"^° • r. 

Now note that for any choice of r < 6/15, the above clearly implies Pr^; [w ■ x ^ [—5, 5]] > 2~^° ■ r 
and by symmetry we conclude that 

Pr^ [wx£ [0,5]] > 2-^°-^-T 

yielding the lemma for ki = 2"^'""^ • r. 

[Case 2: L > Kq.] In this case, we partition [n] into H = [Kq — 1] and the tail T = [n]\ H. We 
similarly have that 

Pr,^ [WH-XH G [0,6]] > 2-^0. 

Now recall that the tail weight decreases geometrically up to the critical index; in particular, Fact [18] 
gives that ||wt||2 ^ 6'^. Then, for a sufficiently small 6, the Hoeffding bound gives 

Pr^^ [wT-xre [-6,0]] > 1/4. 
Combining these inequalities we thus get that 

Pr^ [wxe [-6,6]] > 2--^^«-2. 
By symmetry, we get the desired inequality for K2 = 2~^o-3_ 

The proof follows by selecting k = min{Ki, K.2} = ki for any choice of r < 5/15. □ 

Corollary 37. For any 6 > 0, there is a value k = k[6) > such that for any w G M" with 
\\'w[[2 < 1 and lltt'lloo ^ 6, Pr2:[0 < w ■ x < 6] > n and Y'Tx[—6 < w ■ x <0]'> k. 

Proof. We start by considering the case when 111^112 < 5/100. In this case, by Theorem 1151 we 
certainly get that Pra;[|w ■ x\ < 6] > 99/100. Hence, by symmetry, 'Prx[—6 < w ■ x <0] > 99/200 
and Pr^[0<wx<6]> 99/200. 

Next, we consider the case when [[w[[2 > 5/1500. In this case, if wi > 5^/1500, then we apply 
Lemma [Ml to get that Pr^^ [0 < w ■ x < 5^/1500] > Ki and (by symmetry) Pr^; [-5^/1500 < w • X <■ 
0] > Ki where ki is a positive constant dependent only on 6. 

The only remaining case is when wi < 5^/1500. In this case, the vector w is 5/15-regular. Now, 
we can apply Fact [35]to get that Pra,[0 < w ■ x < 6] > 5/15 and Prx[—6 < w ■ x < 0] > 5/15. By 
taking k = min{5/15, ki, 99/200}, the proof is completed. □ 

Using the above corollary, we show the following lemma: 
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Lemma 38. Let a,ri,^ G M+ with w eMJ^ be such that ^ < \\w\\2 < 1, a > 27], and \\ 
Then, there are positive constants k = K{a, r], ^) and 7 = 7(0, rj, ^) such that 

Prx[—2a + 2r] < w ■ X < —7] > k. 

Proof. We choose a sufficiently small C > ^-iid consider two cases. 

[Case 1: w is (^-regular.] In this case, Theorem [T6l gives us (similar to Fact [35]) that for ^ < 1/20, 
we have 

Pr^[-20C • \\w\\ <wx<-C - \\w\\] > C- 

[Case 2: w is not ("-regular.] We assume without loss of generality that wi = \\w\\oo- In this 
case, it follows by definition that wi hence wi € a — rj]. Since \wj\ < a — for all 

j > 2, Corollary [37l says that (recall that w^^^ = {w2, ■ ■ ■ ,Wn)) 

Pr^(i) -a + 77 < w^^) • < > 0(0,77). 

By independence we thus get 

Pr^ [-2a + 2?7 < w • X < -( ■ ^] >c{a,r])/2. 

Combining Case 1 and Case 2 and using 1 > \\w\\ > we get 

Pr^ [min{-2a + 2r/, -20C} < w ■ x < -( ■ > min{c(a, r/)/2, C}. 

We now choose C > so that 20^ < a — rj. Finally, we set 7 = C • C and k = min{c(a, r/)/2, (} and 
get the claimed result. □ 

The next lemma is a robust version of Lemma [3T1 It says that if a vector w of length n is very 
close to having its first 2k entries each being a and its remaining entries all 0, then l{x) = \w ■ x\ 
must have nonnegligible Fourier mass at levels 4 and above. 

Lemma 39. Let a > 0, w G and A; G N, k > 1. Then there are sufficiently small positive 

constants i] = r]{k), r = r(/c) with the following property : Lf for every i S [2A;], we have Wi E 
[a — ri,a] and Ej>2fc(^i)^ — ^^^'^ ''^'^P ^ '■ x ^ \w ■ x\ satisfies W-^[^] > 7 for some 
7 = j{k) > 0. 

Proof. Consider the vector w' = (a, . . . , a, 0, . . . , 0) and the map i' : x \w' ■ x\. We have 

2k 

Xi + ... + Xk 



e'{x) = a-Vk- 



Vk 



By applying Lemma \3T\ we get W- [£'] > a ■ 2 . Note that if rj and r are sufficiently small 
then clearly a > This implies W-^[^'] > ^4^. 
We now observe that 



\i{x)-i'{x)\ = 


n 


n 




k 


k 




n 




1 E • Xi\ 


< 




1 E • 


+ 


E '^i^i 




i=l 


i=l 






4 = 1 




j=k+l 



Let us use hi{x) = \ E(Li Wi-Xi\, h2{x) = \ Ei=i ^'^'^ ^zix) = \ Ej=fc+i WiXi\. Then we may 

rewrite the above as \(.{x) — l'{x)\ < |/ii(x) — /i2(x)| + h^{x). This implies that \i{x) — /(x)p < 
2{hi{x) - h2{x)f + 2{h3{x)f. This in turn yields 

E[(^(x) - i'{x)f] < 2E[{hi{x) - h2{x)f] + 2E[{hs{x)f]. 
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Note that E[{hs{x))'^] = E"=fc+i < t'^- Next, observe that 



\hi{x) - /i2(x)| 



i=l 



1=1 



< 



1=1 



Hence, we get that E[(/ii(x) - h2{x)f] < E[{Ei=iiwi - w'^) ■ Xif] < ^^L^ r]^ = krf . 

Combining these bounds, we get that E[(^(x) — ^'(x))^] < 2{krf' + r^). Hence, we have that 



-2k 



Ak 



We may choose r] and r smah enough so that 



4fc 



-likif + r"^) > 



2{kr]^+T^). 



and the proof is finished. □ 



Given the above lemmas, the proof of Theorem [28] proceeds as foHows: Let wi = a and 
rj = r]{a) > be a sufficiently small constant. Let L be the first index such that wl < a — rj. Since 
wi > 0.3 and \\w\\ = 1, it is clear that L < 1/0.09 < 12. We now consider two cases : 

[Case I: L is even] Then by Claim [3^ there is a choice of X2, • • • ,xl-i, such that X^^=2 w^x^ G 
[— r/, 0]. Using Corollary [37] and noting that wl < a — rj, there is some k = K{a,T]) such that 
Pr^(i-i)[0 < w^-'^~^^ x^^~^^ < a — rj] > K. By independence, we thus get 



(8) 



Note that dSj) implies (by definition) that Infi(^) < wf — ci, for an appropriate constant ci = 
ci{K,L,r]) > 0. 

[Case II: L is odd] Let us choose a sufficiently small ^ > 0. If Hu^^^"^-* II2 > then observe that 
from Claim [M] (applied to the weights wi, . . . ,wl-i) there is a choice of X2, ■ ■ ■ ,xl-i satisfying 
J2k=2 '^kXk G [a - a], i.e. 



Pr 



L-l 

a-'n< Yj WkXk < a 

k=2 



> 2" 



Combining this with Lemma [38l applied to w^^ ^\ we get that 

Pr^(i) [-a + !]< w(^) • x(^) < a - 7(0, ?/, ^)] > 2"^ 



K. 



(9) 



Exactly as before, ([9]) implies (by definition) that Infi(£) < wf — ci, for an appropriate constant 
ci > 0. 

Now consider the only remaining case which is that ||w^^^^^||2 ^ Recall that 1 < L < 12 and 
L is odd; we first claim that that L > 3. Indeed, this must be the case because L = 3 contradicts 
(for ^ and 77 sufficiently small) the assumption r > 2/5. Now, since ^ < 11 and rj and ^ are 
sufficiently small, by applying Lemma [39|) . we get that £ has a constant fraction of its Fourier mass 
above level 2, completing the proof. This finishes the proof of Theorem [ 



4 Proof of Theorem [3] using Theorem [9] 

As described at the start of Section[3l it suffices to prove Theorem [22l We first observe that it suffices 
to prove the theorem for balanced LTFs, i.e. LTFs / : {-1, 1}" {-1, 1} with /(0) = E[/] = 0. 
(Note that any balanced LTF can be represented with a threshold of 0, i.e. f{x) = sign(?i; • x) 
for some w S M".) To see this, let / : { — 1,1}"' — >• {—1,1} be an arbitrary n-variable threshold 
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function, i.e. f{x) = sign(tMo + X^ILi^*^*)' ^^^^ note that we may assume that wq ^ w ■ x for all 
X £ { — 1, 1}". Consider the {n + l)-variable balanced LTF g : {x,y) — )• {—1, 1}, where y S {—1, 1}, 
defined by g{x, y) = sigii{wQy + X]"^;^ WiXi). Then it is easy to see that g{y) = E[/] and g{i) = f{i) 
for all ie[n]. Therefore, W^'^[f] = W^lg] = W^'^[g]. 

Let / = sign(?i;-x) be an LTF. We may assume that w is a proper unit vector, i.e. that \\w\\2 = 1 
and Wi > Wi+i > for i G [n — 1]. We can also assume that w ■ x ^ for all x G {—1, !}"• We 
distinguish two cases: If w is "far" from w* (i.e. the worst-case vector for the Khintchine-Kahane 
inequality) , the desired statement follows immediately from our robust inequality (Theorem [9]) . 
For the complementary case, we use a separate argument that exploits the structure of w. More 
formally, we have the following two cases: 

Let r > be a sufficiently small universal constant, to be specified. 

[Case I: — II2 > t"]- In this case. Proposition [8] and Theorem [9] give us 

W^[/] > iK{w)f > {I/V2 + erf > 1/2 + V2CT 

which completes the proof of Theorem [3] for Case I. 

[Case II: — w)*||2 < t]- In this case the idea is to consider the restrictions of / obtained by fixing 
the variables xi,X2 and argue based on their bias. Recall that for a vector y = (yi, . . . , G M" 
and i G [n] we denote y^*) = (yj+i, . . . ,yn)- We consider the restrictions fij : { — 1, — )• {—1, 1} 
defined by 

hjiy) = sign{wi ■ i-iy + W2 ■ i-iy + w^^^ • y). 
We fix A = 3/4 and consider the following two subcases: 

(a) (Ey[/oi(y)] < A) In this case the function /oi is not very positively biased; we show that the 
Cauchy-Schwarz inequality is not tight. In particular, the degree-1 Fourier vector (/(i))i=i,...,n 
of f{x) = sign(iti • x) and the corresponding weight-vector w form an angle bounded away 
from zero: 

Lemma 40. There are universal constants t,k = k(t) > such that the following holds: 
Let w G M" be any proper unit vector such that \\w — w*\\2 < t and Ey[/oi(y)] < A where 
f{x) = sign(u; • x). Then we have 

Wi[/]>(l + K).(K(u;))^ 

Proof. Note that since wi > W2 the function /oi(y) is an LTF of the form sign(it;'^^^ • y^^^ + 6) 
with ^ > 0, and hence E[/oi] > 0. To deal with this case we recall the following simple fact: 

Fact 41 (Lemma 2.4 in [OSllQ . Let f : {-1, 1}" {-1, 1} be an LTF with 1 - \ E[/]| = p. 
Then W^lf] >p^/2. 

An application of Fact [41] for /oi gives 

WH/oi] > 1/(32). 

Note that by symmetry we also have that Ey[/io(y)] > — A and therefore 

Wi[/io] > 1/(32). 
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Fix k £ {3, ... ,n}. We have that 

f{k) = Inf,(/) = (1/4) • E Inffe_2(/ii) > (1/4) • (hi{k - 2) + ho{k - 2] 

ije{o,i} ^ 

Since the sign of foiik — 2) agrees with the sign of /io(fc — 2) for all k € {3, . . . ,n}, we get 
that 

E /(^)' > (1/16) • (Wi[/oi] + Wi[/io]) > 1/(256). 

fc=3 



Recall that by assumption of the lemma it holds ||'(i'^^^||2 = y'Zl7=3'^i — Parseval's 
identity imphes that ^"=1 /^(^)^ < 1. We can therefore now write 



K{w) = Emw^ < Jwi+wi^jp{i)+p{2) + \\w^'^\\2-JEhkr 

i=i ^ y k=3 

< VWM/]- 1/(256) +r 

where the first inequality follows by two applications of Cauchy-Schwarz and the second 
follows by our assumptions. By squaring and expanding, assuming that r > is sufficiently 
small, we obtain 

{K{w)f < W^[/]- 1/300 

< W^f] - (1/300) W^[/] = (299/300) • W^[/] 

where the second inequality follows from the straightforward fact that W^[/] < 1. This 
proves Lemma HOl □ 

Theorem [3] follows easily from Lemma |30] in this subcase using the "basic" Khintchine in- 
equality with optimal constant, (K.{w))'^ > 1/2. We turn now to the remaining subcase: 

(b) (E.y [/oi(y)] > A = 3/4) In this case, we show that the value /(I) is so large that it alone causes 
W-^[/] to be significantly larger than 1/2. Since Ej,[/oi(y)] > 3/4 it must certainly also be 
the case that By[foo{y)] > 3/4, and by symmetry Ey[fio{y)] < -3/4 and By[fu{y)] < -3/4. 
Consequently we have /(I) = B.^[f{x)xi] > 3/4, and so W^i[/] > /(l)^ > 9/16. 

This concludes the proof of Theorem [31 



5 Alternate proof of Theorem [3] 

Recall that it suffices to prove the theorem for balanced LTFs. The idea of the second proof is to 
perturb the original halfspace slightly so that the perturbed halfspace is defined by a sufficiently 
anti-concentrated linear form w' ■ x. If the perturbed halfspace is regular, one can show that its 
degree-1 Fourier weight is close to 2/7r. Otherwise, there exists a large weight, hence an influential 
variable xi (say). We are then able to show a non-trivial upper bound on the influence of xi on 
the function ^{x) = \ w' ■ x\. 

We require the following terminology: 

Definition 42. The (relative) Hamming distance between two Boolean functions f,g : { — 1, 1}" — s- 

def 

{ — 1, 1} is defined as follows: dist(/, g) = Prx[f{x) 7^ g{x)]. If dist{f, g) < e we say that f and g 
are e-close. 
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Definition 43. Let f : {-1,1}" {-1,1} be an LTF, f{x) = sign(u'o + 

i^iXi), where 

the weights are scaled so that "^I^QUif = 1. Given a particular input x G { — 1,1}" we define 
marg(/,2;) = \wo + Y17=i ■ 

We start by recalling the following result from [OSllj which essentially says that any LTF is 
extremely close to another LTF for which almost all points have large margin: 

Theorem 44. [Theorem 6.1 in WSllf l Let f : {-1,1}" {-1,1} be any LTF and let < r < 1/2. 
Then there is an LTF f : { — 1, 1}" — ?• { — 1, 1} with dist(/, /') < 77(r) satisfying Pr2,[marg(/', x) < 
k(t)] < T, where k{t) = 2-0(i°g'(i/T)/r2) ^^^^ ^ 2-i/«M. 

Let < r < (5 be sufficiently small universal constants (to be chosen later). Given any balanced 
LTF f{x) = sign{w-x), we consider the LTF /' = sign(7i;'-3;), ||ti''||2 = 1, obtained from Theorem 1441 
so dist(/, /') < r/(T) and Pra;[|t(;' • x\ < k{t)] < r. We will exploit the anti-concentration of w' ■ x to 
establish the theorem for /'. We will then use the fact that / and /' are close in Hamming distance 
to complete the theorem. 

We apply Fact 1201 for the degree-1 Fourier vectors of / and i.e. aj = f{i) and bi = f'{i), 
i G [n]. Note that Parseval's identity gives that X]iLi(/(0)^ ^ 1 ^'^d J2i=iif' i'^))'^ — 1- Moreover, 
Plancherel's identity implies that 

E im - f'{i)f < E ifiS) - f'{S)f = E4(/(x) - f'{x)f] = 4dist(/, /') < 4r?. 

i=l SC[n] 

Therefore, 

|Wi[/]-Wi[/']| <4^. (10) 

Therefore, Fact [20] gives that 

The above equation implies that if we show the theorem for /' we are done as long as r] is 
sufficiently small. We can guarantee this by making r sufficiently small. To show the theorem for 
/', we consider two possibilities depending on whether the vector w' defining /' is 5-regular (where 
6 will be determined later). 

[Case I: w' is iJ-regular] In this case, we use the following result from [MORS 10] : 

Theorem 45 (Theorem 48 in [MORSIO] ). Let S > be a sufficiently small universal constant and 
f be a 6-regular LTF. Then \'W^[f] - W{^[f])\ < S'^/^. 

We give a full description of the W{-) function in Section [6.11 here we only will use the fact 
that W{0) = 2/tt. Theorem [m thus gives that W^[f'] > | - S^^^ and by ([TO]) we obtain 

Wi[/]>--5^/^-4V^. (11) 

TT 

This quantity can be made arbitrarily close to 2/7r by selecting 6, r to be small enough constants. 

[Case II: w' is not 5-regular] In this case, we have that \wi\ = maxj > 5. Let us assume 
without loss of generality that w[ > 0. (The other case is entirely similar.) By Proposition [8] and 
Fact [23] we have 

W^[/'] > {K{w'))^ = 1 - Var(£) > 1 - (1/2) • Inf(£), 

where i{x) = \w' ■ x\. Lemma [Ml already implies that Var[£] < 1/2, but we are able to prove a 
better upper bound in this case. To prove a better upper bound on the variance, we exploit that 
w'l > S toupper bound Infi(£) by a quantity strictly smaller than {w[)'^. For this, we recall the 
following result from [MORSIO] : 
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Theorem 46 (Theorem 39 in jMORSlO] ). Let f{x) = sign(^"^-^ WiXi — wq) be an LTF such that 

^iwf = 1 and 6 \wi\ > \wi\ for all i G [n\. Let < e < 1 he such that \ E[/]| < 1 — e. Then 
\f{l)\=^l[5eHog(l/e)). 

We can now state and prove our main lemma for this case: 

Lemma 47. In the context of Case II, we have Infi(£) < {w[f - 2(/'(l) - 2t)k{t)w[ + (/'(I) - 
2t)k(t)2. 

Proof. Since w[ > 6 and Y17=ii'^i)'^ ~ ™ apphcation of Theorem H6l gives Infi(/') = /'(I) > 
ci ■ w'^, where ci is a universal constant. 

To analyze the desired quantity, we partition the hypercube {—1,1}" into pairs {x~^,x~) that 
differ only in the fist coordinate with xf = l and x^ = —1. That is x"*" = (l,y) and x~ = (— l,y) 
with y G {—1, 1}"~^. We say that such a pair is "good" if both the following conditions hold: (1) 
the corresponding hypercube edge is bi-chromatic (i.e. f'{x^) = 1 and f'{x~) = — ijl, and (2) 
min{|i(;' • \w' - a;"!} > k(t). It is easy to see that the fraction of pairs that are "good" is at least 
7'(1) - 2r, i.e. Pr^e|„i,i}n-i [g] > /'(I) - 2t, where G is the event g = {y e {-1, l}"-i | the 
pair (l,y), (— l,y) is good}. Indeed, the probability that the edge (l,y), (— l,y) is monochromatic 
is 1 — Infi(/') = 1 — /'(I) and the probability that either \w' ■ x'^\ < k{t) or \w' ■ x~ \ < k{t) is at 
most r, hence the claim follows by a union bound. 

Now if y G {~1) 1}"~^ is such that the corresponding pair x^ = (1, y) and x~ = (—1, y) is good, 
we have that = w'l+c' > k{t) and = w'l—c' > k,{t), where c' = (^2, • • • ■,w'j^)-y. From 

this we deduce that |c'| < \w'^ — k(t)| < \w'i\, where the second inequality holds for a sufficiently 
small choice of r. Hence, the analysis of Lemma [2^ yields that in this case Var[^(a;i, y)] = c'^ < 
{w'l — k{t)) . In all other cases. Lemma [Ml vields the upper bound Var[£(xi,y)] = c'^ < {w[)^. We 
can thus bound from above the desired influence as follows: 

Infi(£) = E,g|„i^i|„-i[Var[^(xi,y)]] 

< (/'(I) - 2r) • {w[ - K(r))' + (1 - /'(I) + 2T)iw[f 

< iw[f - 2(7'(1) - 2t)k{t)w[ + (7'(1) - 2r)K(r)2. 

This completes the proof. □ 
Combining Lemma [47] with our earlier arguments, we obtain 

w'[/i2i+(?(i)-2.w.w-mz|iwi)! 

and using pO]) we conclude 

[/]>! + (7(1) - 2rHr)w[ - (/^(l) - 2r).(r)^ _ ^^^^ 

At this point it is straightforward to complete the proof of Theorem [3l Indeed, we select 5 > 

to be a sufficiently small constant and t'= ci ■ 6/4 <^ 6. First, note that the bound of (jlip for the 
regular case can be made arbitrarily close to 2/7r. Regarding the bound of ()12p for the non-regular 
case observe that 

7(1) - 2r > ci6 - ci(5/2 = ci6/2 
^This is the only possibility since w'l > 0, hence /' is monotone nondecreasing in xi. 
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which means that the advantage over 1/2 is at least 



(1/2) • ci5''k{t) - (1/4) • ci5K(r)2 - Ay/^ 

which is lower bounded by a universal positive constant, since the second and the third terms are 
negligible compared to the first for our choice of parameters. This concludes the proof of Theorem[3j 

6 Proof of Theorem [4l An approximation algorithm for W-^[LTF] 

Our approach heavily uses Gaussian analysis, so we record some basic definitions and facts that we 
will need below. 

6.1 Gaussian Facts 

Definition 48. We write (j) for the probability density function of a standard (i.e. zero mean, 
unit variance) Gaussian; i.e. <j){t) = (27r)~^/^e~* We denote by J\f (0,1) the corresponding 
distribution and by J\f (0,1)'^ (orM"^) the standard n- dimensional Gaussian distribution. 

Fact 49. (Rotational Invariance) Let U : M" — )■ M" be a unitary transformation, i.e., U^U = I. If 
X ~ AA(0, 1)", then Ux ~ AA(0, 1)". 

Definition 50. Let /ig : M — )■ { — 1, 1} denote the function of one Gaussian random variable x given 
by hg{x) = sign(x — 6). 

Definition 51. The function // : R U {±00} — t- [—1, 1] is defined as fi{6) = E2,^_/^(o,i)[^6'(^)]- 
plicitly, fj,{9) = —1 + 2 (j){x)dx. We note that /x is strictly monotone decreasing, hence invertible 
on [-1,1]. 

Definition 52. The function W : [-1, 1] [0,2/7r] is defined by W{x) = (2(/)(//-i(a;)))2. Equiva- 
lently, W is defined so that W{fi{e)) = {24>{e)f . 

The next two facts appear as Propositions 24 and 25 in [MORS 10] respectively. 
Fact 53. Let X ~ J\f{0, 1). We have: 

. (i)B[\X-e\]=2m-0m, 

• (a) \lJ,'\ < Y^2/7r everywhere and \ W'\ < 1 everywhere, and 

• (Hi) If \u\ = 1 — T], then W{v) = Qijf' log(l/r/)). 

Fact 54. Let f{x) = sign(ii; ■ x — 9) be an LTF such that \\w\\2 = 1. Then 

. (t) f{o)=^E,^x4f{x)] = m, 

• (a) f{i) ''^ 'Ex~j\rAf{^)xi\ = \lw{f{^))wi, for all i G [n], and 

• (n^)Y:=iP{^ = W{fm. 
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6.2 Proof of Theorem H 

We recall the statement of Theorem H) 

Theorem |4l There is an algorithm that, on input an accuracy parameter e > 0, runs in time 
2P°iy(i/'^) and outputs a value such that 

W-^[LTF] < r, < W-i[LTF] + e. 

We recall the simple algorithm used to prove Theorem H] from Section 11.41 

Let K = G(e~^'^). Enumerate all ii'-variable zero-threshold LTFs, and output the value 
Fg =^min{W^[/] : / is a zero-threshold K-variable LTF.}. 

As described in Section 11.41 it suffices to prove that for any zero-threshold n-variable LTF 
f{x) = sign('u; • x), there is a X-variable zero-threshold LTF g, where K = 0(e~^^), such that 

\W'[f]-W'[g]\<e; (13) 

we now proceed with the proof. We can of course assume that n > K, since otherwise ()13p is 
trivially satisfied for g = f with e = 0. 

We choose a parameter 6 = O(e^); as described in Section [L^ the proof is by case analysis on 
the value of the (5-critical index c{w, 6) of the weight vector w. Consider a parameter L = L{6) = 
0((5~^). We consider the following two cases: 

[Case I: Large critical index, i.e. c{w,5) > L{6)] In this case, the proof follows easily from the 
following lemma: 

Lemma 55 (Case 11(a) of Theorem 1 of [SerOTj ). Let f{x) = sign(t(; • x) = sign{wH • xh + wt ■ xt), 
where w is proper, H = [L{5)] and T = [n\ \ H. If c{w,6) > L{S), then f is 5-close in Hamming 
distance to function the junta g{x) = s\gn{wH ■ xh)- 

Since dist(/,c/) < 5, Fact [20] implies that |W^[/] - VJ'^[g\\ < < e. Noting that 5 is a 
zero-threshold i^-variable LTF (since L < K) completes the proof of Case L 

[Case II: Small critical index, i.e. c{w,5) < L{6)] This case requires an elaborate analysis: at 
a high-level we apply a variable reduction technique to obtain a junta g that closely approximates 
the degree-1 Fourier weight of /. Note that there is no guarantee (and it is typically not the case) 
that / and g are close in Hamming distance. Formally, we prove the following theorem: 

Theorem 56. Let f{x) = sigii{w ■ x) = sign{wH ■ xh + wt ■ xt), where w is proper, H = \c{w, 5)] 
and T=[n]\ H. Consider the LTF g : {-1, 1}I^I+^^ ^ {-1, 1}, with M = Q{e~^^), defined by 

( ^ Zi \ 

g{xH,z) = sign whxh + Wwrh ■ E ^= • 
\ 1=1 \J M J 

Then \W\f \ - ^\g\\ < e. 

Note that g depends on \H\ + M < L + M < K variables. Hence, Theorem [56] completes 
the analysis of Case II. We refer the reader to Section 11.41 for intuition and motivation behind 
Theorem [56] and proceed to its proof in the next subsection. 
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6.3 Proof of Theorem 1561 

Let / : {—1, 1}" —5- { — 1, 1} where /(x) = sign(it; • x) = sign^wnxn + wtxt), where the tail vector 
wt is (^-regular. Assume wlog that ||wt||2 = 1- We proceed in the following three steps, which 
together yield Theorem 1561 

Step 1: "Gaussianizing" the tail. First some notation: we write Un to denote the uniform 
distribution over {—1, 1}". Our main result in this case is the following theorem, which roughly says 
that letting tail variables take Gaussian rather than Boolean values does not change the "degree- 1 
Fourier coefficients" by much: 

Theorem 57. Let f = sign{wH ■ xh + wt ■ xt)- For i G [n] define f{i) = Exr^u„[f{x)xi] and 
hi) = E^'s~W|si,XT~A^i^i[/(^)^i]- ^/^T r-regular then 

E (/»-/«) =o{t'/'). 
1=1 ^ ' 

Note that by applying Fact [20] the above theorem implies that W^[/] ss'^^''^^ W^[/], where we 

define Wi[/]'^=^'Er=i(/«)'- 

To prove Theorem [57] we need a few lemmas. Our first lemma shows that for a regular LTF, 
its degree-1 Fourier coefficients are close to its corresponding Hermite coefficients. 

Lemma 58. Let f{x) = sign{w ■ x — wq) be an LTF. For i G [n] define f{i) = Eix£Un[f{x)xi] and 
f{i) =^B,^^n[f{x)xi]. Ifw is T-regular, then Er=i(/(0 " /(^))' = 0(t1/6). 

Proof. We can assume that \\w\\2 = 1. Since w is r-regular, by Fact 1 1 91 (i) we have that /(O) /(O). 
It suffices to show that 

E/«' + E/«'«^ (14) 

i=l 4=1 i=l 

We first note that the lemma follows easily for the case that \wo\ > y21n(2/7^. In this case, 
by an application of the Hoeffding bound (Theorem llSp it follows that |/(0)| > 1 — 2r, hence 
1/(0)1 > 1 - 3r. By Parseval's identity we have EHi /(O' < E<l>^sc[n] fiS? = 1 - f{0? < 4r; 
similarly, in the Gaussian setting, we get EILi /(^)^ — E0^5/('S')^ < 1 — /(O)^ < 6r. Hence, we 

conclude that Er=i(/(0 " Ri)? < 2 Er=i /(^)' + 2 ELi /(^)' = 0(r). 

We now consider the case that |tt;o| < y21n(2/T) and proceed to prove By Fact [5H (iii) we 

get Er=i/(0^ = W{f{0)). Moreover, Theorem [35] gives that EiLi /(^)^ ^(/(O))- We now 

claim that W{f{0)) W{f{0)). This follows from the mean value theorem, since /(O) /(O) 
and \W'\ < 1 everywhere, by Fact [53] Therefore, we conclude that the LHS of (fT4|) satisfies 

E + E fW 2iy(/(o)). 

i=l 1=1 

For the RHS of (I14p we can write 

E fii)fii) = \^W{f{0)) E mf{i) = ^WUm E,ewJ(^i; • x)s\gn{w ■ x - wq)] 
i=i i=i 

where the first equation follows from Fact [53] (ii) and the third is Plancherel's identity. Moreover, 
by definition we have 

^x&Uuiiw ■ x)sign(u; • x - wo)] = B.^(zuJ\w ■ x - wo\] + woE^(zu^[f{x)]. 
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Recalling Fact [19] we deduce that 

'^xmAiw ■ x)s\gn{w ■ X - Wo)] Ri(l"'ol+i)r E^g_^n[(t(; • x)sign(w • x - wq)]. 
Now, the RHS above satisfies 



ExeAT" [{w ■ 3;)sign(u; • x - wq)] = ExeAr[|^ -wo\]+ wo ^x<^M-- [f{x)] = 2(j){wo) = \J W{f{Q)) 

where the first equality follows by definition, the second uses Fact [53] (i) and the third uses the 
definition of 6. Therefore, 



^x^Aiw ■ x)s\gn{w ■ x - wo)] V^(/(0))- 

Since the function W is uniformly bounded from above by 2/7r, we conclude that the RHS of ([1 
satisfies 

n ^ 



i=l 



The proof now follows from the fact that {\wo\ + 1)t < r-*^/^, since I^qI = 0(-^log(l/r)) 



□ 



Our next lemma, a simple generalization of Lemma [58] above, shows that for any LTF, if the 
variables in its tail are replaced by independent standard Gaussians, the corresponding degree 
1-Fourier and Hermite coefficients of the tail variables are very close to each other. 

Lemma 59. Let f = sign{wH ■ xh + wt ■ xt)- For i £ T, define f{i) = 'Exr^u„[fix)xi] and 



m 



E 



.Af\T\[f{x)xi]. IfwT is T-regular, then EiGT(/(^) " /(^))^ = 0{t^/^). 



Proof. Fix an assignment p G {—1, 1}'^' to the variables in H (head coordinates) and consider the 
restriction fp over the coordinates in T, i.e. fp{xT) = s\gn{wH ■ p + wt ■ xt)- For every assignment 
p, the restriction fp is a r-regular LTF (with a different threshold) ; hence Lemma [58] yields that 
for all /3 G {—1, 1}'^' we have 



Hence, we obtain 



i&T 



i&T 



E (/p«-/p«)' = o(r 



l/6^ 



(15) 



/p(0 - fpii) 



< E 



E 



E ifpii) - fpii 



0(r 



l/6^ 



where the first equality uses the definition of the Fourier /Hermite coefficients, the first inequality 
follows from Jensen's inequality for each summand, the second equality follows by linearity and the 
last equality uses (fT5]) . □ 



Replacing the Boolean tail variables by Gaussians alters the Fourier coefficients of the head 
variables as well. Our next lemma shows that the corresponding change is bounded in terms of the 
regularity of the tail. 
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Lemma 60. Let f = sign{wH • xh + wt ■ ^t)- For i £ H define f{i) = Ex,~^u„[f{x)xi] and 
/(^) = E^'H~W|if|,XT~A^ii'i[/(^)^i]- ^f^T is T-regular, then Y^ieHUi^) - R^)? = ^(t^)- 

Proof. We define the functions /' : {-1, [-1, 1] and /" : {-1, [-1, 1] as follows : 

f{xH) = ^XT(iU\T\ [fixH^xr)] and f"{xH) = E^^g_^|T| [Hxh^xt)] ■ 
By definition, for all i G it holds f'{i) = f{i) and f"{i) = f{i)- We can therefore write 

E m-mf = E {f'ii)-rii)f < e {f'is)-r{s)f = E.e^/|«,(/'(x)-/"(x))2 < ii/-riiL 

ieH i&H SCH 

where the second equality is Parseval's identity and the final inequality follows from the monotonic- 
ity of the norms (|| • ||oo denotes the sup- norm of a random variable). 

In order to bound ||/' — /"||oo we exploit the regularity of the tail via the Berry-Esseen theorem. 
Indeed, fix an assignment p G { — 1, l}'^' to xh- Then 

\f'ip) - f"ip)\ < 2 Pr^^g|_;^ ;l||t| [wt-xt + wh ■ P>0]- Pr^^g_^|T| [wt-xt + wh- P> 0] 

Since wt is r-regular, by Fact [T71 the RHS above is bounded from above by 2r. Since this holds 
for any restriction p to the head we conclude that ||/' — /"||oo < 2r as desired. □ 

Theorem 1571 follows by combining Lemmas 1591 and 1601 

Step 2: "Collapsing" the tail. Let F : x M ^ {-1,1} be defined by F{xH,y) = 

sigii{wHXH+y) (recall that we have assumed that w is scaled so that the "tail weight" HtfTlb equals 
1). Fori G i7, we define F(i) = F,^^^u^^^^y^j^(^Q^i)[F{xH,y)xi] andF(y) = E^^^^^^^y^j^^Q^i^[F{xH,y)y]. 

We also denote W^[i<'] = Ej6-ff(-^(^))^ ~^ F{y)^. Our main result for this step is that "collapsing" 
all |T| tail Gaussian variables to a single Gaussian variable does not change the degree- 1 "Fourier 
weight" : 

Theorem 61. We have that Wi[F] = Wi[/]. 

The theorem follows by combining the following two lemmas. 
Lemma 62. For every i G H, f{i) = F{i). 

Proof. The lemma follows straightforwardly by the definitions. Indeed, for every i € H, 

hi) = E^s-Winh^T-A/"!^! [sign(u^/fX/^ + WTXT)Xi] = E^^^U\H\,y^M{0,l)[s^S^{wHXH + y)xi] = F{i) 
where the third equality uses the fact that wt ■ x^ is distributed as A/'(0, 1). □ 
Lemma 63. We have that {F{y)f = Y^iarihi)? ■ 

Proof. This lemma is intuitively clear but we nonetheless give a proof. We need the following 
simple propositions. 

Proposition 64. Leth -.W^ with h G L2{J\f{0, 1)™). Let U : R"^ ^ R"^ be a unitary linear 

transformation. For i G [m], define h[i) = E2;^_/vm [/i(x)xi] and h{i)' = E^j^a/"'" [^(^)(^^)«]- Then, 
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Proof. Let {Ux)i = XlJLi a-ijXj. By linearity, we get that = Yl]Li o-ijhij). Then, 

m m / m \ / m \ 

E Wf = E E 4 Mjf + E E c^k^a,. Mmj) 

i=l j=l \i=l J ii^i \k=l J 

By elementary properties of unitary matrices, we have (i) EilLi '^ij — 1 fo'^ ^ 2^ ^^^^ (ii) EfcLi ^kjO-ki - 

for i 7^ j. Substitution completes the proof. □ 

Proposition 65. Let ^' : M ^ R and $ : M'" M mt/i $ G L2(7VX0, 1)"") de/ine(i as = 
^ iYIlLiWiXi), where x,w eW^ with \\w\\2 = 1. Then YT=iHxif = "^{vf- 

Proof. It is clear there is a unitary matrix U such that U : xi^ EilLi '^i^i- Hence, an application 
of Proposition [Ml gives us 

m m 

E = E {^.^mAH^) ■ {Ux\]f (16) 

i=l i=l 

Now observe that for i > 1, Ea;g_/v^m [<l>(x) • {Ux)i] = as <I>(x) is independent of {Ux)i. Using the 
rotational invariance of the Gaussian measure and using y instead of {Ux)i we deduce 

E^eArm[$(x) • {Ux)i\ = E,yg^(o,i)[(1'(y)y] = ^{y). 

Combining with ()16p completes the proof. □ 

The proof of the lemma follows by a simple application of the above proposition. Indeed, set 
^(y) = Ej;^^fY|^| y)]. An application of Proposition [65] gives us EieT^(^«)^ ~ ^ivY- Now 

note that for i G T, by definition, <I*(xj) = /(i), and ^{y) = F{y). This completes the proof of the 
lemma. □ 

Step 3: "Booleanizing" the tail. Let M = G(e~^^). Consider the LTF g mapping {xh,z) — )■ 
{-1, 1}, where xh G {-1, l}'^' and z G {-1, 1}*^ defined by 

g{xH,z) = sign (^wh ■ xh + {J2 Zi)/VM^ ■ 

In this step we show that replacing the (single) Gaussian tail variable with a scaled sum of 
Boolean variables does not change the degree- 1 "Fourier weight" by much: 

Theorem 66. We have that |Wi[F] - W^[g]\ = OiA-r^/"^^). 

As expected the theorem follows by combining two lemmas, one to deal with the head and one 
for the tail. 

Lemma 67. We have E»6h(^(0 = O(M-i). 

Proof. The proof closely parallels that of Lemma [60l Namely, we will define the function h,h' : 
{-1, [-1, 1] as h{xH) = By^^(^o^i)[F{xH,y)] and h'ixn) = E^^^m [ffC^^/f, -z)]- Note that for 

1 G H, h{i) = F{i) and h'{i) = g{i). As in Lemma[60l we have J2ieHiHi) - h'{i))^ < \\h - 
For any p G { — 1, 1}'^', we can write 

M 

Hp) - h'{p)\ = 2| PTyeX{o,i)[wHP + y > 0] - Pr, ^UmIwhP + E ^r/^ > 0]|. 

1=1 

Theorem 1161 shows that the RHS is bounded from above by 2/v'M, which completes the proof of 
the lemma. □ 
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Lemma 68. Let F and g as defined above. Then \ - (^(y))^l = 0{M-^/'^'^). 

Proof. First note that, by symmetry, for all i,j G [M] we have g{zi) = g{zj). By definition, we can 
write 



g{zi) = E^^ E 



M 



sign [wH-XH + {Yj^i)N^^ 

1=1 



and let us also denote 



M 



sign [wH ■ XH + {Y. 2i)/v M 



Since the tail of g is l/\/M-regular Lemma [59l implies that 



M 



1=1 



Y.{9{zi)-g{zi)f = 0{M-^l^^) 



i=l 



and by Fact | 



M M 

i=l 1=1 



Since F{y) = '^y^M [sign [wh ■ xh + y)], arguments identical to those of Lemma [631 give us 



i=l 



This completes the proof. 



□ 



7 Proof of Theorem [S An approximation algorithm for T 

In this section we prove Theorem [5] (restated below) : 

Theorem [5l There is an algorithm that, on input an accuracy parameter e > 0, runs in time 
22P°'>'(i/^) outputs a value such that 

T(S) < r, < T(S) + e. 



The main structural result required to prove Theorem [S] is the following theorem (recall that 
S"~^ denotes the unit sphere in M"-, i.e. S"^^ = {x G M" : ||rE||2 = 1}): 

Theorem 69. For any e > 0, there is a value = poly(l/e) such that for any n G N, 

T(S"-i) < T(S-^'~i) < T(S"^i) + e. 

As a corollary, we have T(S) < T(S^'^''="i) < T(S) + e. 

Theorem [691 implies that to compute T(S) up to accuracy e, it suffices to compute T(S-'^^~^); 
i.e., we need to compute inf^g§Ke-i T{w). While S^^~^ is a finite-dimensional object, it is an 
(uncountably) infinite set and hence it is not immediately obvious how to compute inf^g§K£-i T{w). 
The next lemma says that this can indeed be computed in time 2*^^^ "'K 

Lemma 70. For any m G N, T(S'"^^) can be computed exactly in time 2'^(^'"). 

Theorem [5] follows by combining Theorem 1691 and Lemma [70l 
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7.1 Proof of Theorem 



Proof of TheoremWfA Let u; G S" ^. For e > 0, we will prove that there exists a value = 0(l/e^) 
and V G such that \T{v) - T(u;)| < e. Clearly, the upper bound on T(S^'= ^) in Theorem [69l 

follows from this. The lower bound on T(S-'^^~^) is obvious. 

To prove the existence of vector v G , we begin by considering the ry-critical index of w 

for rj = e/64. We also let = C • t/rj^ • log{t/rj) where t will be chosen later to be 0(log(l/7?)) and 
C to be a sufficiently large constant. Clearly, for this choice of rj and t, we have that K = 0(l/e^). 
The next two claims show that whether c{w,ri), the r/-critical index of w, is larger or smaller than 
K, the desired vector v exists in either case. 

Claim 71. Let w G be such that c{w,7]) > K. Then there is a vector v G such that 

\T{v) - T{w)\ < rj. 

Claim 72. Let w G S""^ be such that c{w,7]) < K. Then there exists v G g-^^+^W-i such that 
\T:{v) - T(u;)| < 8r/, where A(r/) = A/rj^. 

In both Claim [TU and Claim [72| the final vector v is at most K + X{7j) < 0(l/e^)-dimensional. 
Hence Theorem 1691 follows by choosing = 0(l/e^). □ 

We start by proving Claim [7T1 We will require the following anti-concentration lemma from 

[usn]: 

Lemma 73. (Theorem 4.2 in JOS 111 ) Let w G S"^^ < ?? < 1/2, f > 1 and let K be defined (in 
terms of t and rj) as above. If c{w,rj) > K, then for any wq G M, we have 



Pi"xe{-i,i}" 



WiXi - Wo 



< ■ 



< 2~ 



where uk 



{K)\ 



Remark 74. Lemma^T^ as stated in lOSll^ has the probability bounded by 0(2 *). However, by 
making C large enough, it is obvious that the probability can be made 2~*. 

Proof of Claim pTTj Choose a specific wq (we will fix it later). By Lemma [73l we have that. 



Pr 



a;6{-l,l}'^ 



WiXi - Wo 

i=l 



< Vi- 



< 2" 



:i7) 



Note that Y7=i'^i^i ~ Yi<K '^i^i + Yi>K '^i^i- ''^T denote the "tail weight vector" wt 
{wk+1, ■ ■ ■ ,Wn)- Since ||tfT||2 = ctr, by Hoeffding's inequality (Theorem [T5|l . we have that 



P'^x6{-i,i}"--f^' 
Define the set ^good,«;o follows : 



\wt • Xt\ > 



< 2" 



(18) 



A 



good,ioo 



xG{-l,ir: 



Y WiXi - Wo 
i=l 



> Vt ■ gr and 



Y WiXi 










i>R 
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We next make a couple of observations about the set Agood,«)o- The first is that combining (jl7|) and 
([ISD, we get that Pr^.g|_i [x ^good,u)o] ^ 2^ + 2~*. Second, for every x G ^good,u;o5 we have 



i=l 



> 



WiXi - Wo 

i=l 



Hence for every x E ^good wn > we have 



Now, consider the vector v' G S" defined as foUows: 

• v[ = Wi for 1 < « < ii'; 

• ^^ft'+i = (^K] and 

• = for j > K + 1. 



Note that for every x G {—1,1}" (and hence for every x G ^good,u;o)5 we have YA=i'^i^i 
Yi^i'^i^i- RecalHng that 



K 




K 


> Vi- ax 


Y V-Xi - Wo 




Yj WiXi - Wo 


1=1 




i=l 





,W0 



and that all x £ {—1,1}" satisfy \ Yi>K ''^'i^il — for t sufficiently large we get that every 
X G -^good,KJo satisfies 

Thus, for X G ^good,«;o) we have that all four events coincide: 

2^,^Jl'^x,<wo 2^,^-i^v'.Xi<wo 2^i^-t_WiXi< 



wo 



Likewise, we also get that for x G Agood,woj 



l^,^jV.Xi>W0 }_^.^-^v'^Xi>W0 2^.^-^WiXi>W0 

Now, let S = ^good,i n Agood,-!- We then get that for x G 5, 



ELl WiXi<Wo' 
"'"ELi WiXi>Wo' 



Since Pr^.g|_i i}n [x ^good.ioo] < 2 s +2 * for tt^o G { — 1,1}, as a result we have Pr[2; ^ S] < 
2 • (2~*/s + 2-*). Taking t = 81og(16/r?), we get that Pr[x ^ S] < rj/A. This implies that 



Pr 



x£{~i,iY- 



n 

Y 'WiXt 

i=l 



< 1 



Pr 



x£{-i,iY- 



Y v'i^t 

i=l 



< 1 



< r]/4.. 



Since the final n — K — 1 coordinates of v' are zero, if we simply truncate v' to the first K + 1 
coordinates, we get a vector v G such that 



Pr 



X6{-1,1}" 



Y '^i^i 
1=1 



< 1 



6{-l,l}^+l 



K+1 

Y ^i^i 
i=l 



< 1 



and Claim [7T] is proved. 



□ 
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We next move to the proof of Claim [72j For that, we wih need the fohowing key proposition. 

Proposition 75. Let w,u € S"^^^ be such that wi = Ui for 1 < i < K . Suppose moreover that 

^(K) (^yjK+i, ■ ■ ■ , Wn) CLnd u^^^ {uK+1, • • • , Un) are both rj-regular. Then, for any wq G M, we 
have that 



|Prx6{-i,i}" [w- X <wo]- Pr^e{_i,i}n [u ■ x < wo] \ < 4r/ 



and 



\Pr^(,{_i^iyn[w • X > u;o] - Pr^6{-i,i}" [^^ ■ x > wo]\ < 4??. 
Proof. Consider any fixed setting of variables xi,...,xk G Note that X^^Li Wi^Cj 

J2i<K ^i^i + Y.i>K '^i^i- We have 



Pr 



a;{Jf)6{_l,l}n-x 



E WiXi < Wo 



.1=1 



Pr 



x(^)e{-i,i}"-^ 



i>K i<K 



However, as it;^^) is r/-regular, by Theorem 1 16t we get 



Pr„ 



{X)e{_l,l}n-X 



Likewise, we have 



i>K i<K 



UiXi <Wo-Y '^i^i 
i>K i<K 



Pr 



Pr 



M{Q,\\W^^^\\)<W0- Y W^Xi 
i<K 



Ar{0,\\u^^^\\) <wo- Y UiXi 

i<K 



< 2r]. 



< 2r]. 



As Wi = Ui for 1 < i < i^, we get that 
Pr 



!:(^)e{-i,i}"--^' 



Y '^i^i <Wo-Y '^i^i 
i>K i<K 



Pr„ 



(X)g{_l_l}n-A' 



Y WiXi <Wo-Y '^i^i 
i>K i<K 



< Arj. 



As the above equation is true for any setting of xi, . . . , x^, we get that 

|Prxe{-i,i}" [w ■x<wq\- Pr^g{_i^i}n [u ■ x < wq\\ < Arj. 
The second part of the proposition follows in exactly the same way. 



□ 



Proof of Claim 12. Let w = {wi, . . . ,wk, ■ ■ ■ ,Wn) where K' < K is the ?/-critical index of w. 
Construct a new vector v' such that = Wi for 1 < i < K' . For 1 < j < X{r]) = 
we let v'-_^j = (??/2) • where as before w^^ ^ denotes the (n — i^')-dimensional vector 

{wK'+i, • • • , Wn)- For j > A(7/), we define u-^^ = 0. 

It is clear that v' G S" and that v'^^ •* is ?7-regular. By Proposition [75l we have 



|Pr^,g|_i^i}n[t/; • X < 1] - Pr^g|_^i}n[z;' • x < 1]| < 4?? 



and 



Combining these two, we get 

|Pr^g|_i^i}n [\w • x| < 1] - Pr^,g|_i^i}n • x| < 1] I < Srj. 

As all the coordinates of v' beyond the first K' + coordinates are zero, if we truncate v' to its 
first K' + \{r]) coordinates, we get v G §^ +K'n)~'^ such that 

Pr^'e{-i,i}"[k • a;| < 1] - Pr^g|_i^i}A-'+AW [k ■A<A 
and Claim [72] is proved. □ 
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7.2 Proof of Lemma [70] 



Proof of Lemma \ 7(J[ This proof is based on the decidabihty of the existential theory of the reals. 
In particular, we use the following result due to Renegar |Ren88j . 

Theorem 76. lRen88f There is an algorithm Arcu which, given a set of real polynomials pi, ■ ■ ■ ,Pm. '■ 
— )• R and qi, - ■ ■ ,qk '■ 1^" ^ with rational coefficients, decides whether there exists an x £ R" 
such that 

• Vz € [m], pi{x) > 0, and 

• Wi £ [k], qi{x) > 0. 

If the bit length of all coefficients in all polynomials is at most L and the maximum degree of any 
polynomial is at most d, then the running time of Ajien is L^^^^ ■ ((m + k) ■ d)^^^\ 

The following is an obvious corollary of the above theorem : 

Corollary 77. There is an algorithm which, given a set S C { — 1, 1}'", decides whether there exists 
a vector w G S™"-*^ such that every x £ S has \vu ■ x\ > 1. The algorithm runs in time 2'^^™' \ 

Proof Let po : IR"" ^ ^ be defined as po{w) = Y.'^i wf - 1. For each x£ S, define : R"" ^ R 
to be qx{w) = (Xl^i Wi ■ XiY — 1- Consider the following set of constraints (call it C) : 

• Poiw) > 0. 

• -po{w) > 0. 

• Vx G S, qx{w) > 0. 

Clearly, this set of constraints has a solution if and only if there exists some w € S™~^ such that 

• x| > 1 for all X £ S. Note that each of the polynomials in C is of degree 2 and all coefficients 
have constant-size representations. The total number of constraints in C is \S\ + 2 < 2™ + 2. 
This means that Ajien can decide the feasibility of C in time 2™-^(™) = 2'^('"') which proves the 
claim. □ 

Finally, our algorithm is simply the following : 

• Enumerate over all sets S C { — 1, l}™ in descending order of their size. For each set S, 

— Use the algorithm in Corollary [77] to decide whether there is a vector vu E S'""^ such 
that \u} • x\ > 1 for all x £ S. 

— If there is such a vector w then exit and return 1 — \ S\/2"^~^, else go to the next set S. 

Since the total number of subsets of { — I,!}'" is 2^™ and every step takes only 2'^^™"^ time, the 
total running time is 2'^^^'"). 

To see correctness of the algorithm, fix a vector w* £ S™"^ such that Pr^^^_i^ijm[\uj* • x\ < 
1] = T(S™-i), i.e. Pr^g|_i,i},n[|u'* • x| < 1] < Pr^^{_i^i}m[\w ■ x\ < I] for ah w £ §™-i. We have 
that Pr^^^_i^ijm.[\w* ■ x\ > 1] > Fr^^^_i^ijm[\w ■ x\ > 1]. Since our algorithm enumerates over all 
sets S" C { — 1, 1}™ in decreasing order of their size, its correctness follows. □ 
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